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Abstract. In this paper we investigate the complexity of abduction, a 
fundamental and important form of non-monotonic reasoning. Given a 
knowledge base explaining the world's behavior it aims at finding an 
explanation for some observed manifestation. In this paper we consider 
propositional abduction, where the knowledge base and the manifesta- 
tion are represented by propositional formulae. The problem of deciding 
whether there exists an explanation has been shown to be Ej-complete 
in general. We focus on formulae in which the allowed connectives are 
taken from certain sets of Boolean functions. We consider different vari- 
ants of the abduction problem in restricting both the manifestations and 
the hypotheses. For all these variants we obtain a complexity classifica- 
tion for all possible sets of Boolean functions. In this way, we identify 
easier cases, namely NP-complete, coNP-complete and polynomial cases. 
Thus, we get a detailed picture of the complexity of the propositional 
abduction problem, hence highlighting sources of intractability. Further, 
we address the problem of counting the explanations and draw a com- 
plete picture for the counting complexity. 
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1 Introduction 



This paper is dedicated to the computational complexity of abduction, a funda- 
mental and important form of non-monotonic reasoning. Given a certain consis- 
tent knowledge about the world, abductive reasoning is used to generate explana- 
tions (or at least telling if there is one) for observed manifestations. Nowadays 
abduction has taken on fundamental importance in Artificial Intelligence and 
has many application areas spanning medical diagnosis [BATJ89], text analy- 
sis [HSAj\I93], system diagnosis [SWOl], configuration problems [AFMU2], tem- 
poral knowledge bases [BLOO] and has connections to default reasoning [SLflO]. 

There are several approaches to formalize the problem of abduction. In this 
paper, we focus on logic based abduction in which the knowledge base is given as 
a set r of prepositional formulae. We are interested in deciding whether there 
exists an explanation E, i.e., a set of literals consistent with F such that F and 
E together entail the observation. 

From a complexity theoretic viewpoint, the abduction problem is very hard 
since it is Ej-complete and thus situated at the second level of the polynomial 
hierarchy [EG95]. This intractability result raises the question for restrictions 
leading to fragments of lower complexity. Several such restrictions have been 
considered in previous works. One of the most famous amongst those is Schaefer's 
framework, where formulae are restricted to generalized conjunctive normal form 
with clauses from a fixed set of relations [C'ZOG, NZ05, NZ08]. 

A similar yet different procedure is to rather require formulae to be con- 
structed from a restricted set of Boolean functions B. Such formulae are called 
B -formulae. This approach has first been taken by Lewis, who showed that the 
satisfiability problem is NP-complete if and only if this set of Boolean functions 
has the ability to express the negation of implication connective -/^ [L(hv79]. 
Since then, this approach has been applied to a wide range of problems in- 
cluding equivalence and implication problems [Rei03, BMTV09a], satisfiability 
and model checking in modal and temporal logics [BHSS06, BSS+08], default 
logic [BMTV09b], and circumscription [Tlio09], among others. 

We follow this approach and show that Post's lattice allows to completely 
classify the complexity of propositional abduction for several variants and all 
possible sets of allowed Boolean functions. We consider two main variants of the 
abduction problem. In the first one we may build explanations from positive 
and negative literals. We refer to this problem as symmetric abduction, Abd for 
short. The second variant, P-Abd, is the so-called positive abduction where we 
allow only positive literals in the explanations. 

We first examine the symmetric variant in the case where the representation 
of the manifestation is a positive literal. We show that depending on the set 
B of allowed connectives the abduction problem is either Sj-complete, or NP- 
complete, or in P and ©LoGSPACE-hard, or in Logspace. More precisely, we 
prove that the complexity of this abduction problem is S2 "Complete as soon as B 
can express one of the functions xV{yA^z), xA{y\/^z) or {xAy)'V{xA^z)y{yA^z). 
It drops to NP-complete when all functions in B are monotonic and have the abil- 
ity to express one of the functions xV{yAz), xA{y\/z) or {xAy)\/{xAz)\/{yAz). 
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The problem becomes solvable in polynomial time and is ©LOGSPACE-hard if 
i3-formulae may depend on more than one variable while being representable 
as linear equations. Finally the complexity drops to Logspace in all remain- 
ing cases. We then complete our study of symmetric abduction with analogous 
complexity classifications of the variants of Abd obtained by restricting the 
manifestation to be respectively a clause, a term or a i?-formula. 

These results are subsequently extended to positive abduction. An overview 
can be found in Figures 1 and 2. 

Please note that in [C'ZOG], the authors obtained a complexity classification 
of the abduction problem in the so-called Schaefer's framework. The two clas- 
sifications are in the same vein since they classify the complexity of abduction 
for local restrictions on the knowledge base. However the two results are incom- 
parable in the sense that no classification can be deduced from the other. They 
only overlap in the particular case of the linear connective ®, for which both 
types of sets of formulae can be seen as systems of linear equations. This special 
abduction case has been shown to be decidable in polynomial time in [Zaii03]. 

Besides the decision problem, another natural question is concerned with the 
number of explanations. This problem refers to the counting problem for abduc- 
tion. The study of the counting complexity of abduction has been started by 
Hermann and Pichler ([HP07]). We prove here a trichotomy theorem showing 
that counting the full explanations of symmetric abduction is either #-coNP- 
complete or ^j^P-complete or in FP, depending on the set B of allowed connec- 
tives. We also consider the counting problem associated with positive abduction, 
for which we distinguish two frequently used settings: counting either all pos- 
itive explanations, or counting the subset-minimal. For both formalizations of 
the counting problem, we get a potentially dichotomous classification with one 
open case. 

The rest of the paper is structured as follows. We first give the necessary 
preliminaries in Section 2. The abduction problems considered herein are defined 
in Section 3. In Section 4 we classify the complexity of symmetric abduction, 
where we first consider the case where the manifestation is a single positive literal 
(Section 4.1), and then turn to variants in which the manifestations are clauses, 
terms and restricted formulae (Section 4.2). Section 5 then studies the complexity 
of positive abduction. An overview of these results is given in Section 6. Finally, 
Section 7 is dedicated to the counting problem, and Section 8 contains some 
concluding remarks. 

2 Preliminaries 

Complexity Theory We require standard notions of complexity theory. For the de- 
cision problems the arising complexity degrees encompass the classes LoGSPACE, 
P, NP, and Sj- more background information, the reader is referred to [Pap94]. 
We furthermore require the class ©LOGSPACE defined as the class of languages L 
such that there exists a nondeterministic logspace Turing machine that exhibits 
an odd number of accepting paths if and only if x G L, for all x [BDHM92]. 
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It holds that Logspace C ©Logspace C P. For our hardness results we em- 
ploy logspace many-one reductions, defined as follows: a language A is logspace 
many-one reducible to some language B (written A <^°s B) if there exists a 
logspace-computable function / such that x € A\i and only if f{x) € B. 

A counting problem is represented using a witness function w, which for every 
input X returns a finite set of witnesses. This witness function gives rise to the 
following counting problem: given an instance x, find the cardinality |w(a;)| of 
the witness set w{x). The class #P is the class of counting problems naturally 
associated with decision problems in NP. According to [HV95] if C is a complexity 
class of decision problems, we define #-C to be the class of all counting problems 
whose witness function is such that the size of every witness y of a; is polynomially 
bounded in the size of x, and checking whether y S w{x) is in C. Thus, we have 
#P = #-P and ^P C #-coNP. Completeness of counting problems is usually 
proved by means of Turing reductions. A stronger notion is the parsimonious 
reduction where the exact number of solutions is conserved by the reduction 
function. 

Propositional formulae We assume familiarity with propositional logic. The set 
of all propositional formulae is denoted by C. A model for a formula tp is a 
truth assignment to the set of its variables that satisfies (p. Further we denote 
by ip[a/ f3] the formula obtained from tp by replacing all occurrences of a with 
(3. For a given set F of formulae, we write Vars(F) to denote the set of variables 
occurring in F. We identify finite F with the conjunction of all the formulae in F, 
Aiper f- Naturally, F[a//3] then stands for Aipgr fi^/P]- For any formula (p & C, 
we write F |= </? if F entails ip, i.e., if every model of F also satisfies tp. 

A literal I is a variable x or its negation -ix. Given a set of variables V, hits(V) 
denotes the set of all literals formed upon the variables in V, i.e., Lits(y) := 
yu{-ix I X € V}. A clause is a disjunction of literals and a term is a conjunction 
of literals. 

Clones of Boolean Functions A clone is a set of Boolean functions that is 
closed under superposition, i.e., it contains all projections (that is, the functions 
/(ai, . . . , On) = Ofc for all 1 < fc < n and n G N) and is closed under arbitrary 
composition. Let B be a finite set of Boolean functions. We denote by [B] the 
smallest clone containing B and call B a base for [B]. In 1941 Post identified the 
set of all clones of Boolean functions [ri).^4A]. He gave a finite base for each of 
the clones and showed that they form a lattice under the usual C-relation, hence 
the name Post's lattice (see, e.g., Figure 1). To define the clones we introduce 
the following notions, where / is an n-ary Boolean function: 

— / is c-reproducing if /(c, . . . , c) = c, c G {0, 1}. 

— / is monotonic if ai < 6i, . . . , a„ < implies /(oi, . . . , a„) < f{bi, . . . , 6„). 

— / is c-separating of degree k if for all A C f~^[c) of size \ A\ — k there exists 
an i G {1, . . . ,n} such that (oi, . . . , a„) G A implies = c, c G {0, 1}. 

— / is c-separating if / is c-separating of degree |/~^(c)|. 

— / is self-dual if / = dual(/), where dual(/)(a;i, . . . , x„) :— -if{-^xi, . . . , ->Xn). 
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— / is affine if / = xi © • • • ffi x„ © c with c G {0, 1}. 

A list of all clones with definitions and finite bases is given in Table 1 on 
page 24. A propositional formula using only functions from B as connectives is 
called a B-jormula. The set of all _B-formulae is denoted by C{B). 

Let / be an n-ary Boolean function. A S-formula Lp such that Vars((/p) = 
{xi, . . . , x„, yi, . . . , ym\ is a B -representation of / if for all ai, . . . , a„, 6i, . . . , 6.,„ G 
{0,1} it holds that /(ai,...,a„) = 1 if and only if every a: Vars(</3) — > 
{0,1} with 17(2;^) = ai and o{yi) = bi for all relevant i satisfies ip. Such a 
B- representation exists for every / £ [B]. Yet, it may happen that the B- 
representation of some function uses some input variable more than once. 

Example 2.1. Let h{x, y) — x A -^y. An {/i}-representation of the function x /\y 
is h(x, h{x, y)). 



3 The Abduction Problem 

Let _B be a finite set of Boolean functions. We are interested in the propositional 
abduction problem parameterized by the set B of allowed connectives. We define 
the abduction problem for B-formulae as 

Problem: ABD(i3) 

Instance: V = (F, A, ip), where 

— F C C{B) is a set of i?-formulae, 

— Ac Vars(F) is a set of variables, 

— e £ is a formula with Vars((p) C Vars(F) \ A. 
Question: Is there a set £' C Lits(A) such that FA£' is satisfiable and 

r A E \= ip (or equivalently F A i? A -^ip is unsatisfiable)? 

The set F represents the knowledge base. The set A is called the set of hypotheses 
and ip is called manifestation or query. Furthermore, if such a set E exists, it is 
called an explanation or a solution of the abduction problem. It is called a full 
explanation if Vars(£') — A. Observe that every explanation can be extended 
to a full one. We will consider several restrictions of the manifestations of this 
problem. To indicate them, we introduce a second argument A4 meaning that ip 
is required to be 

— Q (resp. PQ, NQ): a single literal (resp. positive literal, negative literal), 

— C (resp. PC, NC): a clause (resp. positive clause, negative clause), 

— T (resp. PT, NT): a term (resp. positive term, negative term), 

— jO{B): a i?- formula. 

We refer to the above defined abduction problem as symmetric abduction, since 
every variable of the hypotheses A may be taken positive or negative to construct 
an explanation. We will also consider positive abduction, where we are interested 
in purely positive explanations only. To indicate this, we add the prefix "P-". 
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Thus, for an instance {r,A,ip) of P-ABD(i?, A^), every solution E of (r,A,ip) 
has to satisfy E c_ A. 

The fohowing important lemma makes clear the role of the constants in our 
abduction problem. It often reduces the number of cases to be considered. 

Lemma 3.1. Let B be a finite set of Boolean functions. 

1. If Me {Q,C,T,£(B)}, then 

Abb{B,M) =|°s Abd(BU{1},X), 
P-Abd(S,A^) ee|°s P-Abd(BU{1},7W). 

2. IfM€ {Q,C,T} and V G [S], then 

Abd(S, M) =;°s Abd(B U {0}, M). 

Proof. To reduce ABD(i3 U {1}, A1) to ABD(i3, A^) we transform any instance 
of the first problem in replacing every occurrence of 1 by a fresh variable t and 
adding the unit clause (t) to the knowledge base. The same reduction works for 
P-Abd. To prove Abd(B U {0},M) =^^<i Abd(B,X), let V = (P, A^) be an 
instance of the first problem and / be a fresh variable. Since Ai € {Q, C, T}, we 
can suppose w.l.o.g. that tp does not contain 0. We map V to V = (P', A', i/;), 
where P' is the ^-representation of {(^[0//] V f \ ip eV} and A' = AU {/}. □ 

Of course this lemma holds also for purely positive/negative queries, clauses 
or terms, i.e., Q,C,T can be replaced by PQ,PC,PT or NQ,NC,NT, respec- 
tively. 

Observe that if Bi and B2 are two sets of Boolean functions such that Bi C 
[J52], then every function of Bi can be expressed by a _B2-formula, namely by its 
B2-representation. This way there is a canonical reduction between ABD(i3i, M.) 
and ABD(i32, A4) if Bi C [B2]: replace all i3i-connectives by their B2-representa- 
tion. Note that this reduction is not necessarily polynomial: Since the i32-representation 
of some function may use some input variable more than once, the formula size 
may grow exponentially. Nevertheless we will use this reduction very frequently, 
avoiding an exponential blow-up by special structures of the i?i-formulae. 

4 The complexity of Abd(B) 

We commence with the symmetric abduction problem ABD(i?) The results of 
this section are summarized in Figure 1. We will first focus on the case where if 
is a single positive literal, thus discussing the problem ABD(i3,PQ). 

4.1 The complexity of Abd(B, PQ) 

Theorem 4.1. Let B be a finite set of Boolean functions. Then the symmetric 
abduction problem for propositional B-formulae with a positive literal manifesta- 
tion, ABD(i3,PQ), is 
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1. Ill-complete if 5 02 C [B] or S12 C [B] or Di C [B], 

2. NP-complete i/ Soo C [B] C M or Sio C [S] C M or D2 C [B] C M, 

3. in P and ©LOGSPACE-/iard if L2 C [S] C L, and 
4- in LOGSPACE in all other cases. 

Remark 4-2. For such a classification a natural question is: given B, how hard is 
it to determine the complexity of Abd(_B, PQ)? Solving this task requires check- 
ing whether certain clones are included in [B] (for lower bounds) and whether 
B itself is included in certain clones (for upper bounds). As shown in [VolO!)], 
the complexity of checking whether certain Boolean functions are included in a 
clone depends on the representation of the Boolean functions. If all functions 
are given by their truth table then the problem is in quasi-polynomial-size AC'^, 
while if the input functions are given in a compact way, i.e., by circuits, then 
the above problem becomes coNP-complete. 

We split the proof of Theorem 4.1 into several propositions. 

Proposition 4.3. Let B be a finite set of Boolean functions such that [B] C E 
or [B] C N or [B] C V. Then Abd(B, PQ) e LOGSPACE. 

Proof Let V = (P, A, q) be an instance of ABD(i?, PQ). 

For [B] = N or E, P is equivalent to a set of literals, hence V has the empty 
set as a solution if V possesses a solution at all. Finally notice that satisfiability 
of a set of N-formulae can be tested in logarithmic space [Scli05] . 

For [B] = V each formula G F is equivalent to either a constant or disjunc- 
tion. It holds that (P,A, g) has a solution if and only if P contains a formula 
tp = q V xi V ■ ■ ■ V Xk such that {xi, . . . Xk} Q A, and P[a;i/0, . . . ,Xk/0] is satis- 
fiable. This can be tested in logarithmic space, as substitution of symbols and 
evaluation of V-formulae can all be performed in logarithmic space. □ 

Proposition 4.4. Let B be a finite set of Boolean functions such that L2 C 
[B] C L. Then ABD(i?, PQ) is ©LOGSPACE-Ziarrf and contained in P. 

Proof. In this case, deciding whether an instance of ABD(i?, PQ) has a solution 
logspace reduces to the problem of deciding whether a propositional abduction 
problem in which the knowledge base is a set of linear equations has a solution. 
This has been shown to be decidable in polynomial time in [ZaiiOH]. 

As for the ©LOGSPACE-hardness, let B be such that [B] = L2. Consider the 
©LOGSPACE-complete problem to determine whether a system of linear equa- 
tions S over GF{2) has a solution [BDHM92]. Note that ©Logspace is closed 
under complement, so deciding whether such a system has no solution is also 
©LOGSPACE-complete. Let S — {si, . . . , Sm] be such a system of linear equations 
over variables {xi, . . . , Xn}. Then, for all 1 < z < m, the equation Si is of the form 
^ii + • ■ • + Xi^, = Ci (mod 2) with Ci G {0, 1} and ii,. . . ,in- G {!,..., n}. We 
map 5* to a set of afiine formulae P = {ipi, . . . , ipm} over variables {xi, . . . , a;„, q} 
via 

(fii := © • • • © Xi^, © 1 if Ci = and 
(Pi '.— Xi-^ © • • • © Xi^, if Cj — 1. 
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Now define 

r' -.^ {ipi ® q \ ifi e T is such that ip^{l, . . . , 1) = 0} 
U {(fii \ eT is such that fiil, . . . , 1) = 1}. 

r' is obviously satisfied by the assignment mapping all propositions to 1. It 
furthermore holds that S has no solution if and only if V A -iq is unsatisfiable. 
Hence, we obtain that S has no solution if and only if the propositional abduction 
problem (r',0,q) has an explanation. 

It remains to transform V into a set of B-formulae in logarithmic space. Since 
[B] = L2, we have x (B y ® z e [B\. We insert parentheses in every formula of 
r' in such a way that we get a ternary 0-tree of logarithmic depth whose leaves 
are either a proposition or the constant 1. Then we replace every node © by its 
equivalent _B-formula. Thus we get a (S U {l})-formula of size polynomial in the 
size of the original one. Lemma 3.1 allows to conclude. □ 

Observe that in the cases [B] C L, [B] C E, and [B] C V, the abduction prob- 
lem for B-formulac is self-reducible. Roughly speaking, this means that given an 
instance V and a literal Z, we can efficiently compute an instance V' such that 
the question whether there exists an explanation E with I e E reduces to the 
question whether V' admits solutions. It is well-known that for self-reducible 
problems whose decision problem is in P, the lexicographic first solution can 
be computed in FP. It is an easy exercise to extend this algorithm to enumer- 
ate all solutions in lexicographic order with polynomial delay and polynomial 
space. Thus, if [B] C L or [B] C E or [B] C V, the explanations of Abd(B,PQ) 
can be enumerated with polynomial delay and polynomial space according to 
Proposition 4.3 and 4.4. 

Proposition 4.5. Let B he a finite set of Boolean functions such that Soo C 
[B] C M or Sio C [B] C M or D2 C [B] C M. Then Abd(B,PQ) is NP -complete. 

Proof We first show that ABD(i3, PQ) is efficiently verifiable. Let P — (F, A, q) 
be an ABD(i?, PQ)-instance and E C Lits(yl) be a candidate for an explanation. 
Define F' as the set of formulae obtained from F by replacing each occurrence 
of the proposition x with if -la; G E, and each occurrence of the proposition x 
with 1 if a; £ i?. It holds that i? is a solution for V if F' is satisfiable and F'[g'/0] 
is not. These tests can be performed in polynomial time, because F' is a set of 
monotonic formulae [Lcw79]. Hence, ABD(i3,PQ) e NP. 

Next we give a reduction from the NP-complete problem 2-In-3-Sat, i.e., 
the problem to decide whether there exists an assignment that satisfies exactly 
two propositions in each clause of a given formula in conjunctive normal form 
with exactly three positive propositions per clause, see [Scli78]. Let ip := Aig/ 
with Ci = Xii V Xi2 V Xi3, i S /, be the given formula. We map (f to the following 
instance V — (T,A,q). Let qi, i G /, be fresh, pairwise distinct propositions and 



8 



let A := Vars((p) U {q^ | i G /}. The set T is defined as 



r :={c, I i e /} 



U{V.e/A-=ia;., VV,g,g.V(j}. 



(1) 

(2) 

(3) 



We show that there is an assignment that sets to true exactly two propositions 
in each clause of (p if and only if V has a solution. First, suppose that there exists 
an assignment a such that for all i ^ I, there is a permutation tt^ of {1,2,3} 
such that (T(a::j^,(i)) = and (T(xi^.(2)) — '^i^iTTiis)) = 1- Thus (1) and (2) are 
satisfied, and (3) is equivalent to Vie/ 1^ ^ 1' From this, it is readily observed 
that {^x I a{x) = 0} U {^Qi | i £ /} is a solution to V. 

Conversely, suppose that V has an explanation E that is w.l.o.g. full. Then 
r A ii^ is satisfiable and T A E \= q. Let a: Vars(r) — > {0, 1} be an assignment 
that satisfies T A E. Then, for any x G A, a{x) ^ it ^x £ E, and (j{x) = 1 
otherwise. Since T A E entails q and as the only occurrence of q is in (3), we 
obtain that a sets to each qi and at least one proposition in each clause of (p. 
Consequently, from (2) it follows that a sets to 1 at least two propositions in 
each clause of (p. Therefore, a sets to 1 exactly two propositions in each clause 
of ip. 

It remains to show that V can be transformed into an ABD(i3, PQ)-instance 
for all considered B. Observe that V G [SU{1}] and [SooU{0, 1}] = [D2U{0, 1}] = 
[Sio U {0, 1}] = M. Therefore due to Lemma 3.1 it suffices to consider the case 
[B] = M . Using the associativity of V rewrite (3) as an V-tree of logarithmic depth 
and replace all the connectives in F by their B-representation (V, A G [B]). □ 

Proposition 4.6. Let B be a finite set of Boolean functions such that S02 C [B] 
or S12 C [B] or Di C [B]. Then Abd(B,PQ) is T,"^- complete. 

Proof. Membership in S2 is easily seen to hold: given an instance (F, A, g), guess 
an explanation E and subsequently verify that F A £^ is satisfiable and T AE A^q 
is not. 

Observe that V G [B U {1}]. By virtue of Lemma 3.1 and the fact that 
[S02 U {0, 1}] ^ [S12 U {0, 1}] = [Di U {0, 1}] = BF, it suffices to consider the case 
[B] = BF. In [EG95] it has been shown that the propositional abduction prob- 
lem remains Sj-complete when the knowledge base F is a set of clauses. From 
such an instance (F, A, q) we build an instance of Asd{B, PQ) by rewriting first 
each clause as an V-tree of logarithmic depth and then replacing the occurring 
connectives V and ^ by their i3-representation, thus concluding the proof. □ 

4.2 Variants of Abd(B) 

We now consider the symmetric abduction problem for different variants on the 
manifestations: clause, term and i?-formula. Let us first make a remark on the 
cases where the manifestation is a (not necessarily positive) literal or a negative 
literal. 
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Remark 4-7. Abd(_B,Q) obeys the same classification as ABD(i?, PQ) since all 
bounds, upper and lower, easily carry over. For Abd(_B,NQ) the problem be- 
comes trivial if [B] C M. For [B] C L ABD(i3, NQ) is solvable in polynomial time 
according to [ZaiiO.'-S]. For the remaining clones {i.e., for S02 Q [B], S12 C [B], 
and Di C [B]), we can again easily adapt the proofs of ABD(i3,PQ). This way 
we obtain a dichotomous classification for ABD(i?,NQ) into P-complete and 
Ej-complete cases; thus skipping the intermediate NP level. 

For clauses, it is obvious that ABD(i3,PQ) <!°s Abd(S,PC). Therefore, ah 
hardness results continue to hold for the ABD(i3,PC). It is an easy exercise to 
prove that all algorithms that have been developed for a single query can be 
naturally extended to clauses. Therefore, the complexity classifications for the 
problems Abd(S,PC), ABD(i?, C) and Abd(_B,NC) are exactly the same as for 
ABD(i?, PQ), ABD(i?, Q) and ABD(i?,NQ), respectively. 

Theorem 4.8. Let B be a finite set of Boolean functions. Then the symmetric 
abduction problem for propositional B -formulae with a positive clause manifes- 
tation, ABD(i3,PC), is 

1. Y^l-complete i/ S02 C [B] or S12 C [B] or Di C [B], 

2. NP-complete if Sqq C [B] C M or Siq C [S] C M or D2 C [B] C M, 

3. in P and ©LOGSPACE-Ziard if L2 C [B] C L, and 
4- in LoGSPACE in all other cases. 

Notably, we will prove in the next section that allowing for terms as manifesta- 
tions increases the complexity for the clones V (from membership in Logspace 
to NP-completeness), while allowing i?-formulae as manifestations makes the 
classification dichotomous again: all problems become either P- or E2"Complete. 

The complexity of Abd(B, PT) 

Proposition 4.9. Let B be a finite set of Boolean functions such that V2 C 
[B] C V. Then Abd(B, PT) is NP-complete. 

Proof. Let S be a finite set of Boolean functions such that V2 C [S] C V and 
let V = (r, A, t) be an instance of ABD(i3, PT). Hence, P is a set of B-formulae 
and t is a term, t = k. Observe that E is a solution for V if T A E is 

satisfiable and for every i = 1, . . . ,n, T A E A -^li is not. Given a set E C Lits(A), 
these verifications, which require substitution of symbols and evaluation of an 
V-formula, can be performed in polynomial time, thus proving membership in 
NP. 

To prove NP-hardness, we give a reduction from 3Sat. Let if he a, 3-CNF- 
formula, Lp := /\^^jCi. Let xi,...,Xn enumerate the variables occurring in ip. 
Let x'l, . . . ,x'^ and gi, . . . , (;„ be fresh, pairwise distinct variables. We map (p to 
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V = {T,A,t), where 

r :={ci[-.xi/a;'i,...,-.a;„/x'„] | i G /} 

U {xi \l x\,Xi y qt.x'^M qi\ \ <i < n}, 
A . {xi, . . . , Xm s^i, • • ■ J 
t A • • • A g„. 

We show that Lp is satisfiable if and only \iV has a solution. First assume that 
93 is satisfied by the assignment cr: {xi, . . . , a:„} — > {0, 1}. Define E :— {-iXi | 
(7(2:^) = 0} U {^x\ I (T(a;i) = 1} and a as the extension of a mapping a(x'^ = 
^a{xi) and (T(gi) = 1 for all 1 < i < n. Obviously, a \= V f\E. Furthermore, 
r A -E ^ for all 1 < i < ri, because any satisfying assignment of F A i? sets to 
either Xi or x'^ and thus {x,; \l qi^x'^\l H ^i- Hence E is an explanation for 
V. 

Conversely, suppose that V has a full explanation E. The facts that F A 
E 1= q\ l\ ■ ■ ■ l\ qn and that each qi occurs only in the clauses Xi W qi,x[ V qi 
enforce that, for every i, E contains -^Xi or -^x'^. Because of the clause Xi V x^, 
it cannot contain both. Therefore in E the value of x[ is determined by the 
value of Xi and is its dual. From this it is easy to conclude that the assignment 
a: {xi, . . . ,Xn\ — > {O7I} defined by 17(0;^) = if -^Xi £ E, and 1 otherwise, 
satisfies Lp. Finally V can be transformed into an ABD(i?, PT)-instance, because 
every formula in F is the disjunction of at most three variables and V G [B]. □ 

Theorem 4.10. Let B he a finite set of Boolean functions. Then the symmetric 
abduction problem for propositional B -formulae with a positive term manifesta- 
tion, Abd(B,PT), is 

1. Y^l-complete if S 02 C [B] or S12 C [B] or Di C [B], 

2. NP-complete i/ V2 C [B] C M or Sw C [B] C M or D2 C [B] C M, 

3. in P and ©LoGSPACE-/iard if L2 C [B] C L, and 

4. in LOGSPACE in all other cases. 

Proof. 1. The Ej-hardness follows directly from Proposition 4.6. 

2. For the clones V2 C [B] C V, see Proposition 4.9. In all other clones, the 
NP-hardness follows from a straightforward generalization of the proof of 
Proposition 4.5. 

3. Membership in P follows directly from [^Viin^ Theorem 67], the ©Logspace- 
hardness from Proposition 4.4. 

4. Analogous to Proposition 4.3. □ 

Remark 4- 11- AH upper and lower bounds for Abd(_B,PT) easily carry over 
to ABD(i3,T). It is also easily seen that ABD(i?,NT) is classified exactly as 
ABD(i3, NQ), see Remark 4.7. 

The complexity of Abd(B, C{B)) 
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Proposition 4.12. Let B be a finite set of Boolean functions such that Sqo C 
[B] or Sio C [B] or D2 C [B]. Then Abd{B,C{B)) is T^l-complete. 

Proof. We prove S2-hardness by giving a reduction from the Yi^-hmid prob- 
lem QSAT2 [WimT?]. Let an instance of QSATj be given by a closed formula 
X := 3a;i • • • 3x„V?/i • • • Vj/„i<^ with being a 3-DNF- formula. First observe that 
3a;i • • • 3a;„Vj/i • • • "iym^ is true if and only if there exists a consistent set X C 
Lits({a;i, . . . , such that X n {xi, ^Xi\ ^ 0, for all 1 < z < rt, and -^X V is 
(universally) valid (or equivalently -i^p A X is unsatisfiable) . 

Denote by Tp the negation normal form of -k^ and let Tf' be obtained from 
Tp by replacing all occurrences of -iXi with a fresh proposition x[, 1 < i < n, 
and all occurrences of -^yi with a fresh proposition y'i, 1 < i < m. That is, 
Tp' EE ^[-.xi/a;'i,...,-.x„/<,-.2/i/2/i,...,-.y™/yy. Thus ^' = where 
every c[ is a disjunction of three propositions. To x associate the propositional 
abduction problem V — (F, A, ip) defined as follows: 

T:^{c[yq\iel} 

U {a;, V a; ■ I 1 < i < n} U {yj V y ■ I 1 < i < m} 

\j [f^y x,,t,y x[j,y t, 1 1 < i < n}, 

A \ l<i<n}, 

^■■=qy \^l<,<n{x^ A x'i) V Vi<,;<,„(y» A yO- 

Suppose that x is true. Then there exists an assignment cr: {a;i, . . . , x„} — >■ 
{0, 1} such that no extension a' : {xi, . . . , Xn} U {yi, . . . , ym} — > {0, 1} of a 
satisfies -^(p. Define X as the set of literals over {xi,...,x„} set to 1 by a. 
Defining E :— {^fi,ti \ Xi £ X} U {^ti, fi \ ^Xi e X}, we obtain with abuse of 
notation 

F A £; A -V = A.e/ c'^ A Ai<.<„>* ® <) A Ai<.<™(2/. ® 2/OA 

l\l<i<n,a(xi) = l A Al<i<„,CT(xi)=0 
EE^ipAX, 

which is unsatisfiable by assumption. As F A i? is satisfied by any assignment 
setting in addition all Xi,x[, 1 < i < n, and all yj, y^, 1 < i < m, to 1, we have 
proved that E is an explanation for V. 

Conversely, suppose that V has an explanation E. Assume w.l.o.g. that E is 
full. Due to the clause {fiyti) in F, we also may assume that \Er\{-^ti, -i/JI < 1 
for all 1 < « < n. 

Setting X := {x, | -./^ £ -B} U {^x.^ \ -^t, e E} we now obtain Ai<j<„ ((/» V 
Xi)A{tiy^Xi)A{fiyti))AE = X and TAEA^iIj = -^(pAX as above. Hence, -^ipAX 
is unsatisfiable, which implies the existence of an assignment a: {xi, . . . , Xn} — > 
{0, 1} such that no extension a' : {xi, . . . , Xn} U {yi, . . . , y™} — > {0, 1} of a 
satisfies -^ip. Therefore, we have proved that x is true if and only if V has an 
explanation. 

It remains to show that V can be transformed into an ABD(i3, £(i?))-instance 
for any relevant B. Since [SooU{l}] = Sqi, [SioU{1}] = Mi, [D2U{1}] = Sgi and 
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Soi Q Sqi C Ml, it suffices to consider the case [B] = Sqi by Lemma 3.1. Observe 
that xV {y Az) G [B]. The transformation can hence be done in polynomial time 
by local replacements, rewriting ip as Vi<i<„ g V (a;^ A x-) V Vi<i<m 9 V (y^ A ?/') 
and using the associativity of V. □ 

Theorem 4.13. Let B be a finite set of Boolean functions. Then the symmetric 
abduction problem for propositional B-formulae with a B-formula manifestation, 
Abt>{B,C{B)), is 

1. Y^l-complete if Sqo C [B] or Sio C [B] or D2 C [B], 

2. in P and ©LOGSPACE-/iard if L2 C [B] C L, and 

3. in LOGSPACE in all other cases. 

Proof. 1. See Proposition 4.12. 

2. Membership in P follows directly from [Zan03], the ©LoGSPACE-hardness 
follows from Proposition 4.4. 

3. Analogous to Proposition 4.3. □ 

Observe that there are no sets B of Boolean functions for which ABD(i?, C{B)) 
is NP-complete. 



5 The complexity of P-Abd(B) 

We will now study the complexity of positive abduction, in which an explanation 
consists of a set of positive literals. The results of this section are summarized 
in Figure 2. To begin with, note that for monotonic or 1-reproducing sets of 
formulae, deciding the existence of a positive explanation reduces to a testing 
whether A is one. 

Lemma 5.1. For [B] C Ri or [B] C M, an instance {T.A.ip) 0/ P-Abd(B, 
has solutions if and only if A is a solution. 

Proof. Let C A be an arbitrary explanation for the given instance {T,A^Lp). 
One easily verifies that T /\ A remains satisfiable if F A i? was, and F A A A 
remains unsatisfiable if F A i? A -up was. Conversely, if A is not an explanation, 
no proper subset EGA can be an explanation either. □ 

As a consequence we will see that some of the formerly NP-complete cases 
become tractable and that some of the formerly Ej'Complete cases become coNP- 
complete. 

5.1 The complexity of P-Abd(B, PQ) 

Proposition 5.2. Let [B]<ZU. Then P-Abd(B,PQ) e LoGSPACE. 

Proof. According to Lemma 5.1 it suffices to test if A is a solution, that is, to 
test if F A A^q or equivalently F A A[g/0] is unsatisfiable. This can be done in 
logarithmic space, since F A ^ is a monotonic formula [Lew 79]. □ 
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Proposition 5.3. Let S02 C [B] C Ri or S12 C [B] C Ri or Di C [B] C Ri. 

Then P-Abd(B, PQ) is coNP-comp^ete. 

Proof. According to Lemma 5.1 it suffices to test whether A is a solution. Since 
r A A is always satisfiable, only the task of testing whether F A A A -ig is unsat- 
isfiable remains. And this can be done in coNP. 

Since [Di U {1}] = [S12 U {1}] = Ri, [S02 U {1}] = So and So C Ri, it 
suffices to show hardness for the case So C [B] by Lemma 3.1. To show coNP- 
hardness we give a reduction from 3Sat. Let ip = f\^^j Ci be a 3-CNF-formula, 
i.e., each clause Ci consists of the disjunction of exactly three literals. Since 
[{—>■, 0}] = BF, each clause Ci has a representation as a {->, 0}-formula which we 
indicate by c'^. Let g be a fresh proposition. We map Lp to (F, 0, q), where we define 
F = Ui6/ CiP/q]. Note that F is a set of So-formulae of polynomial size and 1- 
reproducing. Let ip be unsatisfiable. Then F is satisfied by the assignment setting 
to 1 all propositions and FA-ig is unsatisfiable, because it is equivalent to (pA-iq. 
Summing up, is an explanation for (F,0, g). Conversely, let (p be satisfiable. 
This implies that F A -iq is satisfiable and thus (F, 0, q) has no explanations. 

It remains to transform (F,0, g) into a P-Abd(_B, PQ)-instance for any rele- 
vant B. As — > G So C [_B], this is done by replacing in F every occurrence of — >■ 
by its ^-representation. □ 

Theorem 5.4. Let B be a finite .set of Boolean functions. Then the positive 
abduction problem for propositional B-formulae with a positive literal manifesta- 
tion, P-Abd(B,PQ), is 

1. Y^l-complete if D C [B] or Si C [B], 

2. coNP -complete if S02 C [B] C Ri or S12 C [B] C Ri or Di C [B] C Ri, 

3. in P and ©LOGSPACE-Ziarrf if L2 C [B] C L, 

4. in LOGSPACE in all other cases. 

Proof. 1. In [NZ08], Nordh and Zanuttini prove that the abduction problem 
in which the knowledge base is a set of clauses remains E2-hard even if 
explanations are required to comprise positive literals only. A reduction from 
this problem can be done analogously to the one in Proposition 4.6. 

2. See Proposition 5.3. 

3. Membership in P follows from [NZ08, Theorem 66]. For the ©Logspace- 
hardness the same reduction as in Proposition 4.4 works. 

4. See Proposition 5.2. □ 

5.2 Variants of P-Abd(B) 

Having examined the complexity of positive abduction for positive literal man- 
ifestations, we will now consider positive abduction for manifestation that are 
restricted to be respectively a clause, a term, or a _B-formula. But first let us 
make a remark on the complexity of positive abduction when the manifestation 
is a (not necessarily positive) literal or a negative literal. 
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Remark 5.5. Again all upper and lower bounds for P-ABD(i3,PQ) easily carry 
over to P-ABD(i?, Q). For P-ABD(i?, NQ) the problem becomes trivial if [B] C Ri 
(Lemma 5.1). For Lq [B] C \_ and L3 C [B] C L, we obtain membership in P 
from [NZ()8, Theorem 66]. For all remaining cases {i.e., for D C [B] and Si C [B]), 
we obtain Sj'Completeness from an easy adaption of the first part in the proof 
of Proposition 5.4. 

Analogously to the symmetric case the algorithms can be extended to clauses. 
Thus, P-ABD(i3,PC) is classified as P-ABD(i3, PQ). Similarly the classifica- 
tions for P-ABD(i3,C) and P-ABD(i3,NC) are the same as classifications for 
P-Abd(B,Q) and P-Abd(B,NQ). 

Theorem 5.6. Let B be a finite set of Boolean functions. Then the positive 
abduction problem for propositional B-formulae with a positive clause manifes- 
tation, P-ABD(i3, PC), is 

1. Yi^- complete if D <Z [B] or Si C [B], 

2. coNP- complete if $02 C [B] C Ri or S12 C [B] C Ri or Di C [B] C Ri, 

3. in P and ©LOGSPACE-/iard if L2 C [B] C L, 

4. in LOGSPACE in all other cases. 



The Complexity of P-Abd(B, PT) The classification for positive terms is 
identical to the one for a single positive literal, except for the affine clones Lq, 
L3, and L. For these, the complexity of P-ABD(i?, PT) jump from membership 
in P to NP-completeness. 

Proposition 5.7. Let Lq C [B] C L or L3 C [B] C L. Then P-Abd(B,PT) is 
NP-complete. 

Proof Let (P, A, t) with t = Xi be an instance of P-ABD(i?, PT). To check 
whether a given i? C A is an explanation, we have to test the satisfiability of rA£^ 
and the unsatisfiability of P A -E A -^Xi for every i ^ L. These tasks are equivalent 
to solving systems of linear equations, which can be done in polynomial time. 
The hardness follows directly from [NZOH, Theorem 70]. □ 

Theorem 5.8. Let B be a finite set of Boolean functions. Then the positive ab- 
duction problem for propositional B-formulae with a positive term manifestation, 
P-Abd(B,PT), is 

L Y^l-complete if D C [B] or Si C [B], 

2. coNP -complete i/So2 C [B] C Ri or S12 C [B] C Ri or Di C [B] C Ri, 

3. NF-complete if [B] e {L, Lq, L3}, 

4. in P and ©LOGSPACE-Ziard if [B] G {Li, L2}, 

5. in LOGSPACE in all other cases. 

Proof. 1. Follows from the first item of Proposition 5.4. 

2. Both membership and hardness follow from Proposition 5.3. 

3. See Proposition 5.7. 
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4. For the 0LOGSPACE-hardness the same reduction as in Proposition 4.4 works. 
Since Li C R^, according to Lemma 5.1, it sufhces to check whether A is a 
solution. This task reduces to solve systems of linear equations which is in 
P. 

5. Analogously to Proposition 5.2. □ 

Remark 5.9. Again all upper and lower bounds for P-ABD(i?, PT) carry over 
to P-Abd(B,T). For P-Abd(B,NT) the problem becomes trivial if [B] C Ri 
(Lemma 5.1). For [B] e {L, Lq, L3}, we obtain NP-completeness with the hardness 
being obtained from an easy reduction from P-Abd(_B, PT): as we have x © y G 
[B U {1}], we can simply transform the given positive term into a negative one. 
For the remaining cases {i.e., D C [B] and Si C [B]), we obtain E2"Completeness 
from an adaption of the first part in the proof of Proposition 5.4. 

The Complexity of P-Abd(B, £(B)) The complexity of P-Abd(B, £(5)) 
differs from the complexity of P-ABD(i?, PQ) for the clones either (a) above E or 
V and below M or (b) above Lq or L3 and below L. For the former the complexity 
increases to coNP-completeness, whereas for the latter we obtain membership in 
NP and hardness for ©LOGSPACE; the exact complexity of positive abduction 
when both the knowledge base and the manifestation are represented by non-1- 
reproducing affine formulae remains an open problem. 

Proposition 5.10. Let B be a finite set of Boolean functions such that Lq C 
[B] C L or L3 C [B] C L. Then P-Abd(B, £(5)) G NP. 

Proof. Let E C A he a potential solution. The test for satisfiability of F A and 
the test for unsatisfiability oiT A E A -^ip are equivalent to solving two systems 
of linear equations, which can be done in polynomial time. □ 

Proposition 5.11. Let B be a finite set of Boolean functions such that Sqq C 
[B] C M or Sio C [B] C M or D2 C [B] C M. Then F-Abd{B, C{B)) is coNP- 
complete. 

Proof. We will first prove coNP-membership. According to Lemma 5.1, it suffices 
to test whether A is a solution. This can be done by first verifying that T A A 
is satisfiable, and afterwards verifying that T A A A ^ip is unsatisfiable. As F is 
a set of monotonic formulae, deciding the satisfiability of F A A can be done in 
logarithmic space; and deciding whether T A A A ^Lp is unsatisfiable is in coNP. 

To establish coNP-hardness, we give a reduction from the coNP-hard problem 
to decide whether a given 3-DNF-formulae </? is a tautology. Let Vars((/j) = 
{xi, . . . Xn}. Denote by Tp the negation normal form of -k^ and let 'tp' be obtained 
from Tp by replacing all occurrences of ^Xi with a fresh proposition x[, 1 < i < n. 
That is, ^' = Tp[-^xi/ x'l, . . . ,^Xn/ x'^]. Thus Tp' = c'^ where every is a 
disjunction of three propositions. To ip we associate the propositional abduction 
problem V = (F, 0, tp) defined as follows: 

F := {c^ V g I i G /} U {x, \/ x'^ \ 1 < i < n}, 
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Observe that 



T = A^q h f\{xi®x'i). (4) 

1=1 

Suppose that tp \s a. tautology, i.e., -^tp is unsatisfiable. From (4) it follows that 
r A -I'i/' is unsatisfiable. As F is satisfiable, is a solution for V. 

Suppose conversely that is a solution for V. Then F A -i?/' = -^'p A -ig A 
t\i=i{^i ® ^'i) is unsatisfiable. Since q and the x[ do not occur in ip, we obtain 
the unsatisfiability of -^ip. Hence, ip is a. tautology. 

The transformation of V into an P-Abd(_B, >C(_B))-instance for any relevant 
B can be done in exactly the same way as in the proof of Proposition 4.12. □ 

Theorem 5.12. Let B he a finite set of Boolean functions. Then the positive 
abduction problem for propositional B -formulae with a B -formula manifestation, 
V-Abb{B,C{B)), is 

1. i:l-complete if D C [B] or Si C [B], 

2. coNP-complete if S02 C [B] C Ri or S12 C [B] C Ri or Di C [B] C Ri or 
Soo Q[B]CM or Sio Q [B] C M or D2 C [B] C M 

3. in NP and ®LOGSPACE-/iard if [B] G {L, Lo, L3}, 

4. in P and ©LOGSPACE-Ziard if [B] £ {Li, L2}, 

5. in LOGSPACE in all other cases. 

Proof. 1. Follows from the first item of Proposition 5.4. 

2. See Proposition 5.11 and 5.3. 

3. For membership in NP, see Proposition 5.10. The ©LoGSPACE-hardness, on 
the other hand, is established using the same reduction as in the proof of 
Proposition 4.4. 

4. For membership in P, see the fourth item of Proposition 5.8. The ©Logspace- 
hardness follows from Proposition 4.4 as above. 

5. For [B] C V, a B-formula is a positive clause. Thus the result follows from 
Theorem 5.6. For [B] C N and [B] C E, see Proposition 4.3. □ 



6 Overview of Results 

The following two tables give an overview of the results for the studied symmetric 
and positive abduction problems. The small numbers on the right side in the 
table cells refer to the corresponding theorem/proposition/remark. The number 
is omitted for trivial results. 
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Manifestation 




N, 


V. 




D27 S*o ^ 
[B] C M 


Di,S,2 C 
[B] C BF 


NQ,NC,NT 


eL 4.7 


eL 4,7 


e L 4.7 


e P 4.4 


G L 4.7 


EP-C. 4.7 


PQ,PC,Q,C 


e L 4.3 


e L 4,3 


e L 4.3 


e P 4.4 


NP-C. 4.5 


EP-C. 4.6 


PT,T 


e L 4.10 


e L 4,10 


NP-C. 4.9 


e P 4.10 


NP-C. 4.10 


EP-C. 4.10 


C{B) 


e L 4.13 


e L 4,13 


e L 4.13 


€ P 4.13 


S2-C. 4.12 


EP-C. 4.12 



Table 2. The complexity of Abd, where *-subscripts on clones denote all valid 
completions, L abbreviates LOGSPACE, and the suffix "-c." indicates complete- 
ness for the respective complexity class. 



Manifestatior 


V, 


Li,L2 


Lo,L3,L 


D2,S,o C 
[B] C M 


Di,S*2 C 
[B] C Ri 


D,Si C 
[B] C BF 


NQ,NC 
NT 

PQ,PC,Q,C 

PT,T 

C{B) 


e L 
e L 
e L 
e L 
e L 


G L 
G L 

G P 5.4 

e P 5.8 
e P 5.12 


G P 5.5 
NP-C. 5.9 

G P 5.4 

NP-C. 5.7 

e NP 5.10 


e L 
e L 

G L 5.2 
G L 5.8 
coNP-c. 5.11 


e L 

G L 
coNP-c. 5.3 
coNP-c. 5.8 
coNP-c. 5.12 


EP-c. 5.5 
EP-c. 5.9 
E^-c. 5.4 
EP-c. 5.8 
EP-c. 5.12 



Table 3. The complexity of P-Abd, where ^-subscripts on clones denote all 
valid completions, L abbreviates Logspace, and the suffix "-c." indicates com- 
pleteness for the respective complexity class. 



Our results show, for instance, that when the knowledge base's formulae are 
restricted to be represented as positive clauses (i.e., [B] = V), then the abduction 
problem for single literal manifestations is very easy (solvable in Logspace); 
this still holds if the manifestations are represented by positive clauses. But 
its complexity jumps to NP-completeness if we change the restriction on the 
manifestations to allow for positive terms. 

Considering the case that all monotonic functions can be simulated (i.e., 
X C[B]CMlor X € {D2, Sqo, Siq}), the abduction problem is NP-complete for 
manifestations represented by literals, clauses, or terms. Here allowing manifesta- 
tion represented by a monotonic formulae, causes the jump to EP-completeness. 
This increase in the complexity of the problem can be intuitively explained as fol- 
lows. The complexity of the abduction rests on two sources: finding a candidate 
explanation and checking that it is indeed a solution. The NP-complete cases 
that occur in our classification hold for problems in which the verification can 
be performed in polynomial time. If both the knowledge base and the manifesta- 
tion are represented by monotonic formulae, verifying a candidate explanation 
is coNP-complete. 

It comes as no surprise that the complexity of P-ABD(i3, M) \s lower than or 
equal to the complexity of ABD(i3, M) in most cases (except for the affine clones). 
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We have seen in Lemma 5.1 that for monotonic or 1-reproducing knowledge 
bases only one candidate needs to be considered. In these cases the complexity 
of the abduction problem is determined by the verification of the candidate. 
This explains the appearance of coNP-complete cases in our classification. For 
the affine clones {i.e., [B] G {L, Lo, L3}), on the other hand, the tractability of 
Abd(_B, A^) relies on Gaussian elimination. This method fails when restricting 
the hypotheses to be positive, and there is no obvious alternative. 

7 Counting complexity 

We now turn to the complexity of counting. We focus on the case where the 
manifestation is a single positive literal. For the symmetric abduction problem 
we are interested in counting the number of full explanations, #Abd(_B, PQ). For 
positive abduction two counting problems commonly arise: either to count all 
positive explanations, denoted by #P-ABD(i3, PQ); or to count only the subset- 
minimal explanations, denoted by #-C-P-Abd(_B, PQ). Our first result in this 
section is the complete classification of #Abd(_B,PQ). 

Theorem 7.1. Let B be a finite set of Boolean functions. Then the counting 
problem of symmetric abduction for propositional B-formulae with a positive 
literal manifestation, 7^Abd(_B, PQ), is 

1. #-coNP -complete i/ S02 ^ [B] or S 12 C [B] or Dj C [B], 

2. #P-complete i/ V2 C [B] C M or Sio C [B] C M or D2 C [B] C M, 

3. in FP in all other cases. 

Proof. The #-coNP-membership for #ABD(i3,PQ) follows from the facts that 
checking whether a set of literals is indeed an explanation for an abduction 
problem is in P'^^ = A2 and from the equality #-A2 = #-coNP, see [lIV!)o]. 

We show ^-coNP-hardness by giving a parsimonious reduction from the fol- 
lowing ^-coNP-complete problem: Count the number of satisfying assignments of 
ipixi,. . . , ccn) = Vyi • • • Vym^fiixi,. . . , x„, yi, . . . , ym), where (/? is a DNF-formula 
(see, e.g., [DHK05]). Let x'l, . . . , x'^, ri, . . . ,rn,t and q be fresh, pairwise distinct 
propositions. We define the propositional abduction problem V — iT,A,q) as 
follows: 

r :— {xi — i> ri,x[ — ?► ri, -^Xi V -^x[ \ I < i < n} 

UW^t}U{^tlr^/\t^q}, 
A := {xi,. . .,Xn} U {x[, . . .,x'„}. 

Observe that the manifestation q occurs only in the formula f\^^i ri /\ t q. 
This together with the formulae Xi ri,x'i ri, -iXi V -^x'i, 1 < i < n, enforces 
that every full explanation of V has to select for each i either Xi and -icc-, or 
^Xi and x'i. By this the value of x'i is fully determined by the value of Xi and 
is its dual. Moreover, it is easy to see that there is a one-to-one correspondence 
between the models of and the full explanations of V. 
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Observe that since the reductions in Lemma 3.1 are parsimonious, we can 
suppose w.Lo.g. that B contains the two constants 1 and 0. Therefore, it suffices 
to consider the case [B] — BF. Suppose that ip = Miei^i ^^^^^ ^' 
set of formulae obtained by replacing ip ^ t = ~'(Vie/^i) V i by the set of 
clauses {^ti V t\i G /}. Then F' is a set of disjunctions of literals, whose size 
is polynomially bounded by |r|. Hence, by the associativity of V, F' can be 
transformed in logarithmic space into an equivalent set of B-formulae. This 
provides a parsimonious reduction from the above #-coNP-complete problem to 
#Abd(B,PQ). 

Let us now consider the #P-complete cases. When [B] CM, checking whether 
a set of literals E is an explanation for an abduction problem with _B-formulae 
is in P (see Proposition 4.5). This proves membership in #P. For the hardness 
result, it suffices to consider the case [B] — V2, because the reduction provided in 
Lemma 3.1 is parsimonious and V2 C [Sio U {1}]. We provide a Turing reduction 
from the problem #Positive-2-Sat, which is known to be #P-complete [Val79]. 
Let ifi = Ai=iiPi V (Zi) be an instance of this problem, where pi and qi are propo- 
sitional variables from the set X = {xi, . . . , a;„}. Let g be a fresh proposition. 
Define the propositional abduction problem V ~ (F, A, q) as follows: 

F := {pi V (ji V g I 1 < i < fc}, A := {xi, . . . , Xn}- 

It is easy to check that the number of satisfying assignments for tp is equal to 
2" — #Sol(P). Finally, since V G [B] = V2, P can easily be transformed in 
logarithmic space into an ABD(i?, PQ)-instance. 

As for the tractable cases, the clones E and N are easy; and finally, for [B] C L, 
the number of full explanations is polynomial time computable according to 
[HP07, Theorem 8]. □ 

Turning to positive abduction the #P-complete cases vanish, while for the 
L-clones the exact complexity remains open. 

Theorem 7.2. Let B be a finite set of Boolean functions. Then both counting 
problems of positive abduction for propositional B-formulae with a positive literal 
manifestation, #P-Abd(B, PQ), #-C-P-Abd(B, PQ) are 

1. fl^-coNP -complete i/ S02 [B] or S12 C [B] or Di C [B], 

2. m #P i/ L2 C B C L, 

3. in FP in all other cases. 

Proof. The #-coNP-membership follows analogously to the proof of Theorem 7.1. 
Indeed, the same reduction as in the proof of Theorem 7.1 works: there is an 
one-to-one correspondence between full explanations and purely positive expla- 
nations. Moreover, all explanations are incomparable and hence subset-minimal. 

For the affine case membership in #P follows from the NP-membership of 
the corresponding decision problem. 

The remaining cases are encompassed by [B] C M. In this case a slight 
strengthening of Lemma 5.1 is easily seen: Let {T^A^q) be an instance. Then 
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A is an explanation if and only if all subsets of A are explanations. Hence for 
#P-ABD(i?, PQ) the number of solutions is either or 2^^^ (all subsets), while for 
#-C-P-Abd(_B, PQ) it is either or 1 (the empty set). We obtain membership 
in FP, because for monotonic formulae deciding whether A is an explanation 
can be done in Logspace. □ 

We note that for manifestations represented as terms, clauses, or B-formulae, 
most of the classifications of the corresponding counting problems can be easily 
derived from the above results; the exceptions to this are #Abd(M, £(5)) and 
some cases satisfying [S U {1}] — L, whose exact complexity remains open. 

8 Concluding Remarks 

In this paper we studied the decision and counting complexity of symmetric and 
positive propositional abduction from a knowledge base being represented as 
sets of i3-formulae, for all possible finite sets B of Boolean functions. We gave 
a detailed picture of the complexity of abduction in considering restrictions on 
both manifestations and hypotheses. Thus our results highlight the sources of 
intractability, identify fragments of lower complexity, and may help to identify 
candidates for parameters in the study of parameterized complexity of abduction. 

Our restrictions on the hypotheses covered only the symmetric and the posi- 
tive case. One can as well define negative abduction, where explanations consist 
of negative literals only; or non- symmetric abduction, where explanations are 
formed upon a given set of literals, which is not demanded to be closed under 
complement (in contrast to S-Abd). However, results not mentioned herein in- 
dicate that the classifications of these variants are easily seen to be identical to 
S-Abd (except for the L-clones). 

It is worth noticing that, with the exception of the clones between L2 and L, 
whenever the abduction problem is tractable for some clone, it is trivial. In con- 
trast, tractability for the clones between L2 and L relies on Gaussian elimination, 
which fails when we restrict explanations to be positive. Determining the com- 
plexity of positive abduction for the clone L with manifestations represented by 
L-formulae might hence prove to be a challenging task (note that a similar case, 
the circumscriptive inference of an affine formula from a set of affine formulae, 
remained unclassified in [Tlio09]). 
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Name 


Definition 




Base 


BF 


All Boolean functions 




{x r\y,^x} 


Ro 


{/ / is 0-rcprodnfiiig[ 




\x A y, r ?y} 


Ri 


{/ / is l-rcprocluciiig) 




{.r V !l, .V ,1; 11 ,1; 1} 


R2 


Ro n Ri 




{y,x /\ (yQzG 1)} 


M 


{/ 1 / is monotonic} 




{x V y,x Ay, 0, 1} 


Ml 


M n Ri 




{x\/y,xAy, 1} 


Mo 


M n Ro 




{xVy,xAy, 0} 


M2 


M n R2 




{xV y,x Ay} 




{/ 1 / is 0-separating of degree n} 




{x y, dual(h„)} 


So 


{/ 1 / is 0-separating} 




{x- ^ y} 


cn 


{/ / is 1-separating of degree n} 




{x A ^y, h„} 


Si 


{/ 1 / is 1-separating} 




{x A -^y} 


cn 
-"02 


SJnRa 




{xW (y A -•z), dual(^„)} 


S02 


SonR2 




{xV(yA -^z)} 


cn 
■^01 


SJf nM 




{dual(/i„),l} 


Soi 


So n M 




{xV(yAz),l} 


cn 
^00 


Sjf n R2 n M 




{xV (y A 2:), dual(/in)} 


Soo 


So n R2 n M 




{xV(yA z)} 


cn 
^12 


S5'nR2 




{xA{yV ^z),hn} 


S12 


Si n R2 




{xA{yV^z)} 


cn 
-'ll 


S^' n M 




{hn,0} 


Sn 


Si nM 




{xA(yWz),0} 


cn 
■^10 


sy n R2 n M 




{x A (2/ V z),hn} 


SlO 


Si n R2 n M 




{xAiyVz)} 


D 


{/ 1 / is self-dual} 




{{x A ^y) V {x A ^2) V (-.y A ^z)} 


Di 


D n R2 




{{x Ay)V {x A -.2) V (y A ^z)} 


D2 


D n M 




{{x A 2/) V (a; A 2) V (j/ A 2)} 


L 


{/ 1 / is affine} 




{x®y,l} 


Lo 


LnRo 




{x®y} 


Li 


LnRi 




{x®y®l} 


L2 


Ln R2 




{x®y®z} 


L3 


LnD 




© 2/ e 2 e 1} 


V 


{/ 1 / is a disjunction of variables or 


constants} 


{xVy,0, 1} 


Vo 


VnRo 




{xVy,0} 


Vi 


vnRi 




{xVy,l} 


V2 


vn R2 




{xVy} 


E 


{/ 1 / is a conjunction of variables or 


constants} {rr A y, 0, 1} 


Eo 


EnRo 




{x A y, 0} 


El 


EnRi 




{xAy,l} 


E2 


En R2 




{xAy} 


N 


{/ 1 / depends on at most one variable} 


{^x,0,l} 


N2 


N n R2 




{^x} 


1 


{/ 1 / is a projection or a constant} 




{id, 0,1} 


lo 


InRo 




{id,0} 


li 


InRi 




{id, 1} 


I2 


inR2 




{id} 



Table 1. The list of all Boolean clones with definitions and bases, where h. 
Vl=i Ai=i,i/i ^3 and dual(/)(ai, . . . , a„) = -./(-.ai . . . , -.a„). 
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NP-c. 

BL-h./e p 

© NP-C. 
O eL 



Fig. 1. Post's lattice showing the complexity of the symmetric abduction prob- 
lem Abd{B, Ai) for all sets B of Boolean functions and the most interesting 
restrictions M of the manifestations. In the legend, L abbreviates LOGSPACE 
and the suffixes "-h" and "-c" indicate hardness and completeness respectively. 



25 






P-ABD(i3,Q) 




P-Abd(_B,C) 


• 




• 


coNP-c. 




eL-h./e p 


® 


eL-h./e p 


o 


e L 


o 


e L 




P-Abd(S,T) 


% 


Sf-c. 


% 


coNP-c. 


% 


NP-c. 




eL-h./€ p 




e L 


o 


6L 



Fig. 2. Post's lattice showing the complexity of the positive abduction problem 
P-Abd(B, A^) for all sets B of Boolean functions and the most interesting re- 
strictions A4 of the manifestations. In the legend, L abbreviates Logspace and 
the sufHxes "-h" and "-c" indicate hardness and completeness respectively. 
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